Fara 7B

About the Provider

Microsoft is a global technology company and AI platform provider, building large-scale AI systems, research models, and developer tools. Through Microsoft Research and its cloud ecosystem, the company supports open and enterprise AI development across productivity, cloud, and intelligent agent technologies

Model Quickstart

This section helps you quickly get started with the microsoft/Fara-7B model on the Qubrid AI inferencing platform. To use this model, you need:

A valid Qubrid API key
Access to the Qubrid inference API
Basic knowledge of making API requests in your preferred language

Once authenticated with your API key, you can send inference requests to the microsoft/Fara-7B model and receive responses based on your input prompts. Below are example placeholders showing how the model can be accessed using different programming environments.
You can choose the one that best fits your workflow.

from openai import OpenAI

# Initialize the OpenAI client with Qubrid base URL
client = OpenAI(
  base_url="https://platform.qubrid.com/v1",
  api_key="QUBRID_API_KEY",
)

# Create a streaming chat completion
stream = client.chat.completions.create(
  model="microsoft/Fara-7B",
  messages=[
    {
      "role": "user",
      "content": "Explain quantum computing in simple terms"
    }
  ],
  max_tokens=4096,
  temperature=0.7,
  top_p=1,
  stream=True
)

# If stream = False comment this out
for chunk in stream:
  if chunk.choices and chunk.choices[0].delta.content:
      print(chunk.choices[0].delta.content, end="", flush=True)
print("\n")

# If stream = True comment this out
print(stream.choices[0].message.content)

Model Overview

Fara 7B is a Computer Use Agent (CUA) model designed to take actions on the web to complete user goals. It operates by understanding browser screenshots, tracking previous actions, and deciding the next action required to move toward a task. The model predicts actions step-by-step instead of generating only text.

Model at a Glance

Feature	Details
Model ID	`microsoft/Fara-7B`
Provider	Microsoft
Architecture	Decoder-only Transformer
Model Size	7B params
Parameters	4
Context Length	8192 Tokens
Training Data	Mixed web, curated instructional datasets, code, and multilingual corpora
Dependency Model	Qwen 2.5-VL

When to use?

You should consider using Fara 7B if:

You are building browser-based automation
You need an agent that can take actions, not just generate text
Your workflow relies on screenshots + action history
You want step-by-step execution of user goals

Inference Parameters

Parameter Name	Type	Default	Description
Streaming	boolean	true	Enable streaming responses for real-time output.
Temperature	number	0.7	Controls creativity and randomness; higher values produce more diverse output.
Max Tokens	number	4096	Maximum number of tokens the model can generate.
Top P	number	1	Nucleus sampling that restricts token selection to a probability mass threshold.

Key Features

Computer Use Agent (CUA) : Designed to take actions on the web to accomplish high-level user tasks.
Multimodal Input : Uses browser screenshots along with text and action history to decide next steps.
Step-by-Step Action Execution : Predicts actions with grounded arguments such as coordinates for clicks.
On-Device Execution : Provides privacy guarantees and lower latency.

Execution Constraints

The model stops execution at critical points, including:

Entering personal information
Completing purchases
Making calls
Sending emails
Submitting applications
Signing into accounts

Summary

Fara 7B is a 7B parameter Computer Use Agent developed by Microsoft Research.

It performs web-based tasks by understanding browser screenshots and text context.
The model executes actions step by step using prior action history.
Outputs include internal reasoning followed by tool calls for execution.
It is designed for automated web workflows with on-device execution support.

Getting started

GPU Compute

Inferencing

AI Tools

About the Provider

Model Quickstart

Model Overview

Model at a Glance

When to use?

Inference Parameters

Key Features

Execution Constraints

Summary

Getting started

GPU Compute

Inferencing

AI Tools

​About the Provider

​Model Quickstart

​Model Overview

​Model at a Glance

​When to use?

​Inference Parameters

​Key Features

​Execution Constraints

​Summary

About the Provider

Model Quickstart

Model Overview

Model at a Glance

When to use?

Inference Parameters

Key Features

Execution Constraints

Summary